NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multi-Scale Graph Learning for Anti-Sparse Downscaling

https://doi.org/10.1609/aaai.v39i27.35014

Fan, Yingda; Yu, Runlong; Barclay, Janet R; Appling, Alison P; Sun, Yiming; Xie, Yiqun; Jia, Xiaowei (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Water temperature can vary substantially even across short distances within the same sub-watershed. Accurate prediction of stream water temperature at fine spatial resolutions (i.e., fine scales, ≤ 1 km) enables precise interventions to maintain water quality and protect aquatic habitats. Although spatiotemporal models have made substantial progress in spatially coarse time series modeling, challenges persist in predicting at fine spatial scales due to the lack of data at that scale. To address the problem of insufficient fine-scale data, we propose a Multi-Scale Graph Learning (MSGL) method. This method employs a multi-task learning framework where coarse-scale graph learning, bolstered by larger datasets, simultaneously enhances fine-scale graph learning. Although existing multi-scale or multi-resolution methods integrate data from different spatial scales, they often overlook the spatial correspondences across graph structures at various scales. To address this, our MSGL introduces an additional learning task, cross-scale interpolation learning, which leverages the hydrological connectedness of stream locations across coarse- and fine-scale graphs to establish cross-scale connections, thereby enhancing overall model performance. Furthermore, we have broken free from the mindset that multi-scale learning is limited to synchronous training by proposing an Asynchronous Multi-Scale Graph Learning method (ASYNC-MSGL). Extensive experiments demonstrate the state-of-the-art performance of our method for anti-sparse downscaling of daily stream temperatures in the Delaware River Basin, USA, highlighting its potential utility for water resources monitoring and management.
more » « less
Free, publicly-accessible full text available April 11, 2026
StreamPULSE Sensor Data and Metabolism Estimates for Rivers and Streams

https://doi.org/10.4211/hs.0c9c4c2fe8914690af8b3c8495503d7c

Vlah, Michael; Anderson, Steven; Appling, Alison; Arroite, Maite; Berdanier, Aaron; Blaszczak, Joanna; Bonjour, Sophia; Carter, Alice; Chamberlin, Catherine; Cohen, Matthew; et al (September 2025, Hydroshare)

Stream metabolism, encompassing gross primary production and ecosystem respiration, reflects the fundamental energetic dynamics of freshwater ecosystems. These processes regulate the concentrations of dissolved gases like oxygen and carbon dioxide, which in turn shape aquatic food webs and ecosystem responses to stressors such as floods, drought, and nutrient loading. Historically difficult to quantify, stream metabolism is now measurable at high temporal resolution thanks to advances in sensor technology and modeling. The StreamPULSE dataset includes high-frequency sensor data, metadata, and modeled estimates of ecosystem metabolism. This living dataset contributes to a growing body of open-access data characterizing the metabolic pulse of stream ecosystems worldwide. To contribute to StreamPULSE, visit data.streampulse.org. All data contributed to StreamPULSE become public after an optional embargo period. Use this publication to access annual data releases, or use data.streampulse.org to download new data as they become available.
more » « less
Physics-guided machine learning from simulated data with different physical parameters

https://doi.org/10.1007/s10115-023-01864-z

Chen, Shengyu; Kalanat, Nasrin; Xie, Yiqun; Li, Sheng; Zwart, Jacob A.; Sadler, Jeffrey M.; Appling, Alison P.; Oliver, Samantha K.; Read, Jordan S.; Jia, Xiaowei (August 2023, Knowledge and Information Systems)

Full Text Available
Deep learning approaches for improving prediction of daily stream temperature in data‐scarce, unmonitored, and dammed basins

https://doi.org/10.1002/hyp.14400

Rahmani, Farshid; Shen, Chaopeng; Oliver, Samantha; Lawson, Kathryn; Appling, Alison (September 2021, Hydrological Processes)
null (Ed.)
Basin-centric long short-term memory (LSTM) network models have recently been shown to be an exceptionally powerful tool for stream temperature (Ts) temporal prediction (training in one period and making predictions for another period at the same sites). However, spatial extrapolation is a well-known challenge to modeling Ts and it is uncertain how an LSTM-based daily Ts model will perform in unmonitored or dammed basins. Here we compiled a new benchmark dataset consisting of >400 basins across the contiguous United States in different data availability groups (DAG, meaning the daily sampling frequency) with or without major dams and studied how to assemble suitable training datasets for predictions in basins with or without temperature monitoring. For prediction in unmonitored basins (PUB), LSTM produced an RMSE of 1.129 °C and R2 of 0.983. While these metrics declined from LSTM's temporal prediction performance, they far surpassed traditional models' PUB values, and were competitive with traditional models' temporal prediction on calibrated sites. Even for unmonitored basins with major reservoirs, we obtained a median RMSE of 1.202°C and an R2 of 0.984. For temporal prediction, the most suitable training set was the matching DAG that the basin could be grouped into, e.g., the 60% DAG for a basin with 61% data availability. However, for PUB, a training dataset including all basins with data is consistently preferred. An input-selection ensemble moderately mitigated attribute overfitting. Our results indicate there are influential latent processes not sufficiently described by the inputs (e.g., geology, wetland covers), but temporal fluctuations are well predictable, and LSTM appears to be a highly accurate Ts modeling tool even for spatial extrapolation.
more » « less
Full Text Available
Partial Differential Equation Driven Dynamic Graph Networks for Predicting Stream Water Temperature

https://doi.org/10.1109/ICDM51629.2021.00011

Bao, Tianshu; Jia, Xiaowei; Zwart, Jacob; Sadler, Jeffrey; Appling, Alison; Oliver, Samantha; Johnson, Taylor T. (December 2021, 2021 IEEE International Conference on Data Mining (ICDM))

Full Text Available
Physics-Guided Machine Learning from Simulation Data: An Application in Modeling Lake and River Systems

https://doi.org/10.1109/ICDM51629.2021.00037

Jia, Xiaowei; Xie, Yiqun; Li, Sheng; Chen, Shengyu; Zwart, Jacob; Sadler, Jeffrey; Appling, Alison; Oliver, Samantha; Read, Jordan (January 2022, 2021 IEEE International Conference on Data Mining (ICDM))

Full Text Available
Long‐term change in metabolism phenology in north temperate lakes

https://doi.org/10.1002/lno.12098

Ladwig, Robert; Appling, Alison P.; Delany, Austin; Dugan, Hilary A.; Gao, Qiantong; Lottig, Noah; Stachelek, Jemma; Hanson, Paul C. (January 2022, Limnology and Oceanography)

Full Text Available
Modeling dataset: Long-term Change in Metabolism Phenology across North-Temperate Lakes, Wisconsin, USA 1979-2019

https://doi.org/10.6073/pasta/af991c26bace5af8d4b3bb66d7b18af7

Ladwig, Robert; Appling, Alison P.; Delany, Austin; Dugan, Hilary A.; Gao, Qiantong; Lottig, Noah; Stachelek, Joseph; Hanson, Paul C. (January 2022, Environmental Data Initiative)

This dataset includes model configurations, scripts and outputs to process and recreate the outputs from Ladwig et al. (2021): Long-term Change in Metabolism Phenology across North-Temperate Lakes. The provided scripts will process the input data from various sources, as well as recreate the figures from the manuscript. Further, all output data from the metabolism models of Allequash, Big Muskellunge, Crystal, Fish, Mendota, Monona, Sparkling and Trout are included.
more » « less
Modeling dataset: Long-term Change in Metabolism Phenology across North-Temperate Lakes, Wisconsin, USA 1979-2019

https://doi.org/10.6073/pasta/bf44814c101e7e085b2663556a9188ca

Ladwig, Robert; Appling, Alison P.; Delany, Austin; Dugan, Hilary A.; Gao, Qiantong; Lottig, Noah; Stachelek, Joseph; Hanson, Paul C. (January 2022, Environmental Data Initiative)

This dataset includes model configurations, scripts and outputs to process and recreate the outputs from Ladwig et al. (2021): Long-term Change in Metabolism Phenology across North-Temperate Lakes. The provided scripts will process the input data from various sources, as well as recreate the figures from the manuscript. Further, all output data from the metabolism models of Allequash, Big Muskellunge, Crystal, Fish, Mendota, Monona, Sparkling and Trout are included.
more » « less
Differentiable modelling to unify machine learning and physical models for geosciences

https://doi.org/10.1038/s43017-023-00450-9

Shen, Chaopeng; Appling, Alison P.; Gentine, Pierre; Bandai, Toshiyuki; Gupta, Hoshin; Tartakovsky, Alexandre; Baity-Jesi, Marco; Fenicia, Fabrizio; Kifer, Daniel; Li, Li; et al (July 2023, Nature Reviews Earth & Environment)

Process-based modelling offers interpretability and physical consistency in many domains of geosciences but struggles to leverage large datasets efficiently. Machine-learning methods, especially deep networks, have strong predictive skills yet are unable to answer specific scientific questions. In this Perspective, we explore differentiable modelling as a pathway to dissolve the perceived barrier between process-based modelling and machine learning in the geosciences and demonstrate its potential with examples from hydrological modelling. ‘Differentiable’ refers to accurately and efficiently calculating gradients with respect to model variables or parameters, enabling the discovery of high-dimensional unknown relationships. Differentiable modelling involves connecting (flexible amounts of) prior physical knowledge to neural networks, pushing the boundary of physics-informed machine learning. It offers better interpretability, generalizability, and extrapolation capabilities than purely data-driven machine learning, achieving a similar level of accuracy while requiring less training data. Additionally, the performance and efficiency of differentiable models scale well with increasing data volumes. Under data-scarce scenarios, differentiable models have outperformed machine-learning models in producing short-term dynamics and decadal-scale trends owing to the imposed physical constraints. Differentiable modelling approaches are primed to enable geoscientists to ask questions, test hypotheses, and discover unrecognized physical relationships. Future work should address computational challenges, reduce uncertainty, and verify the physical significance of outputs.
more » « less
Full Text Available

« Prev Next »

Search for: All records